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^_^ I Abstract: In this article, we count the number of return words 

V«/ ' in some infinite words with complexity 2n+ 1. We also consider 

r~| ■ some infinite words given by codings of rotation and interval ex- 

j^ , change transformations on k intervals. We prove that the num- 

ber of return words over a given word w for these infinite words 
is exactly k. 
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^ ■ 1 Introduction 

O 

^) • The starting point of this article is the following question: Does there exist a 

way to characterize by return words the infinite word with complexity 2n+l ? 
j^ , We first give the definition of return words for a recurrent infinite word in 

a finite alphabet (a recurrent word is infinite word such that each word 
appears infinitely many times). Considering each occurrence of a word w in 
a recurrent infinite word U, we define the set of return words over w to be the 
p\ ' set of all distinct words beginning with an occurrence of w and ending exactly 

j^ ■ before the next occurrence of vu in the infinite sequence. This mathematical 

tool was introduced independently by Durand, Holton and Zamboni in order 



to study primitive substitutive sequences (see ||l^, |ll|, [l^]). This notion is 
quite natural and can be seen as a symbolic version of the first return map 
for a dynamical system. Recently, many developments of the notion of 
return words have been given. For example, Allouche, Davinson, Queffelec 
and Zamboni study the transcendence of Sturmian or morphic continued 
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fractions and the main tool is to show, using return words, that arbitrary 
long prefix are "almost squares" (see [^). We can also use return words 
to study low complexity infinite words. For example, the author shows 
that an infinite word is Sturmian if and only if for each word appearing 
in the infinite word, the cardinality of the set of return words over w is 
exactly two (see [^]). Recall that Sturmian infinite words are aperiodic 
words with complexity p{n) = n + 1 for all n (the complexity function 
p{n) counts the number of distinct factors of length n in the infinite word) 
(see m, ^, 15]). Fagnot and Vuillon show a generalization of the notion 



of balanced property for Sturmian words and the proof is based on return 



words and combinatorics on words (see |13|). Cassaigne uses this tool to 
investigate a Rauzy's conjecture (see [^). 

Our propose is to compute the set of return words for a class of infinite 
words with complexity p{n) = 2n + 1 for all n (see |^, ^, |, |9|, |l^, |l^, \i%). 



The following question is thus natural: is it true that for these infinite 
words, the cardinality of the set of return words over a factor w is always 
three ? 



In this direction, Justin and Vuillon |17] show that Arnoux-Rauzy infinite 
words [^ , which are infinite words with complexity 2n + 1 (see [^, Q, |^, |l^ ) , 
have property R^. In the sequel, we say that an infinite word have the 
property Rn if the cardinality of the set of return words over each factor w 
is exactly n. They give also the structure of return words in the context of 
Episturmian words [12, 17]. 



In the opposite, the work of Ferenczi ]14] presents a nice substitutive 
infinite word with complexity p{n) = 2n + 1 given by the Chacon transfor- 
mation. It is not difficult to show that the cardinality of the set of return 
words may be upper than three for this infinite word. This is the first 
example of infinite word with complexity 2n+l without the property R^. 

Nevertheless, we find two other classes of infinite words with complexity 
p{n) = 2n+l having the property R^. The first one is the coding of a rotation 
a in the unit circle with three intervals (with rationaly independent lengthes) 
and one of these intervals of length a. The second one is a generalization of 
this class namely the interval exchange transformation on three intervals. 

The structure of the article is the following. First, we recall some basic 
definitions in combinatorics of words. Secondly, we present the infinite words 
given by codings of rotation. Then, we show that an infinite word given by 
a regular interval exchange with three intervals (resp. n intervals) have the 
property R^ (resp. Rn)- We compute also the length of each return word 
over a given finite word w. At last, we show that the coding of a rotation 
in the unit circle with three (resp. n intervals) intervals with rationaly 



independent lengthes as property R^ (resp. i?„) 

More generally, we guess that the class of infinite word with complexity 
p{n) = 2n + 1 and exactly three return words over each factor is the class of 
infinite words given by a "self-induced discrete dynamical system". In par- 
ticular, we may extend these results to bounded remainder sets introduced 
by Rauzy and Ferenczi. 

2 Basic definitions and examples 

Let A = {0, 1} be a binary alphabet. We denote by A* the set of finite 
words on A and by A^ the one-sided infinite word U. A word ty is a factor 
of a word x € A* if there exist some words u,v & A* such that x = uwv. An 
infinite word U is called recurrent if every factor of U appears infinitely many 
times in U. For a finite word w = wiW2- ■ ■ Wn, the length of w is denoted 
by jtf [ and is equal to n. The set of factors of U with length n is denoted 
by Ln{U). The language L{U) = UnLniU) is the set of factors of U. For two 
finite words w and n, the number of occurrences of w on n is denoted by 
\u\w and \u\w = {i\ < i < \u\ — |u7| s.t. UiUi-^-i ■ ■ ■ Ui+|«,|-i = W1W2 • ■ ■ w\y^\\. 
The position set {i{U, w) = {«i, ^2) • " " ) ^fc) ' " "}) of the word w is a. set of 
integers i{U, w) = {ii, ^2, • ' ' > ^fc; ■ ' '} where i^ represents the position of the 
first letter of the k-th occurrence of the word w in the infinite word U. In a 
more formal way, i^ G i{U,w) if and only if Ui^JJi^j^i ■ ■ ■ t/ij.+|«,|_i = w and 
\Ui ■ ■ ■ C/jj,+|^|„i|^ = k. Since the infinite word U is recurrent the set i{U, w) 
is infinite. For a recurrent word C/, the set of return words over w is the set 
(denoted by TCu,w) of all distinct words with the following form: 

for all /c € N, A; > 0. This definition is best understood on an example. Let 
Ui = (0100100001)'^ be an infinite word on the alphabet A. By definition, 
the set of return words over 01 is Tiuifii = {010,01000,01}. Indeed, the 
infinite word Ui can be written 

(01001000010100100001)'^ 

where _ denotes the position of the first letter for each occurrence of the word 
01. We note that between two consecutive occurrences of 01, we have the 
three possible words 0, 000, e. Thus 010,01000 and 01 are elements oiTLu-ifli- 
We say that an infinite word have the property i?„ if the cardinality of the 
set of return words over each factor w is exactly n (i.e. Card{7iu,w) = 2 

yw e L{U)). 



The complexity function p : N ^ N of an infinite word U counts the 
number of distinct factors of U of given length: 

p{n) = Cardlwl w € -L„(C/)}. 

3 A negative result 

Let consider the foUowing substitution extensively studied by Ferenczi (see 
[p^ ) which is a recoding of the Chacon substitution: 

a{l) = 12, 
a{2) = 312, 
o-(3) = 3312. 
The fix point o"(x) = x of the substitution begins by 

X = 1231233121231233123312123121231233121231231233 • • • 

It is easy to check with few terms of the fix point x that the number of 
return words over 23 is upper than 3, indeed: 

7^23 = {231,2331,23121,233121}. 

4 Codings of rotation 

The aim of this section is to introduce codings of irrational rotation on 
the unit circle. For p > 2, let F = {/3o < Pi < • • • < Pp-i} be a set of 
p consecutive points of the unit circle (identified in all that follows with 
[0, 1[ or with the unidimensional torus M/Z) and let [3p = (3q. Let a be an 
irrational number in ]0, 1[ and let us consider the positive orbit of a point 
X of the unit circle under the rotation by angle a, i.e., the set of points 
{{an + x}, n G N}. 

The coding of the orbit of x under the rotation by angle a with respect to 
the partition {[/3o)/3i[, [/3i,/32[, . . . , [/?p_i,/3p[} is the infinite word u defined 
on the finite alphabet E = {0, . . . ,p — 1} as follows: 

Un = k <:^ {x + na} G [Pk,Pk+i [, for < /c < p - 1. 

A coding of the rotation R means the coding of the orbit of a point x of the 
unit circle under the rotation R with respect to a finite partition of the unit 
circle consisting of left-closed and right-open intervals. 



4.1 Factors 

With the above notation, consider a coding u of the orbit of a point x under 

the rotation by angle a with respect to the partition {[/3o, /5i [, [/3i, /32[, . . . , [/3p-i, /3p[}. 

Let Ik = [/3fc)/5fc+i[ and let R denotes the rotation by angle a. 



Lemma 1 wi- ■ ■ Wn defined on the alphabet S = {0, 1, . . . ,p—l} is a factor 



of the infinite word u if and only if I{wi, . . . , Wn) = n?=o ^ ''{Iwj+x) 7^ 



Proof A finite word wi- ■ ■ Wn defined on the alphabet S = {0, 1, . . . ,p — l} 
is a factor of the infinite word u if and only if there exists an integer k such 
that 



Uj + l; 



{x + ka} G I{wi,... ,Wn) = n ^ ^^^'^ 

j=0 

As a is irrational, the sequence {{x + na})neN is dense in the unit circle, 
which implies that wiW2 . . .Wn ^s a factor of u if and only if /(wi, . . . , Wn) 7^ 
0. In particular, the set of factors of a coding does not depend on the initial 
point X of this coding. Furthermore, the connected components of these sets 
are bounded by the points 

{k{l - a) + Pi}, ioT < k < n - I, 0<i<p-l. 

These sets consist of finite unions of intervals. More precisely, if for every 
k, /3fc+i — Pk 1^ sup(a, 1 — a), then these sets are connected; if there exists 
K such that (3k+i — Pk > sup(a, 1 — a), then the sets are connected except 
for wi ... Wn of the form a^ (see |]l|] ) (the notation a^ denotes the word of 
length n obtained by successive concatenations of the letter ax)- Let us note 
that there exists at most one integer K satisfying Pk+i — Pk > sup(Q, 1 — q). 



5 Interval exchange transformations 

This section deals with interval exchange transformations. This object is 
a natural generalization of the codings of rotation a where a is the length 
of one of the intervals. An interval exchange transformation is a piecewise 
affine transformation which maps a partition into intervals of the space to 
another partition according to a permutation. This transformation could be 
more complicated than a rotation. Indeed, irrational rotations are uniquely 
ergodic, unless a class of interval exchange transformations is non uniquely 



ergodic. In the sequel, we use the notations of Keane and Rauzy (see [18, 19[|) 



An interval exchange of k intervals is defined by a vector (Ai, A2, • ■ " ) -^fc) 
in M'^ with strictly positive coordinates and X]i=i ^i — ^: ^"^^ ^ permutation 
a of the set {1, 2, • • • , k}. We set the partition of [0, 1[ in A; intervals 



Xi 



j<i j<i 



for 1 < i < k 



{Xi has the length Aj). 

The interval exchange transformation associated to the ordered pair 
{a, A) is the transformation from [0, 1[ to itself, defined as a piecewise affine 
transformation which maps the partition (Xi,X2, ■ • • ,Xk) to the partition 

The transformation T maps the point x (z Xi to the point 

r(x) = X + tti 

where 

«i = 2^ Ao-(fe) - 2^ Afc. 

fc<o--l(i) k<i 

Now, we construct an infinite word with values in the alphabet {1, 2,- ■ ■ ,k} 
associated with a couple {a, A) by coding the positive orbit of a point x by the 
transformation T according to the partition (Xi,X2, • • • ,Xk)- The infinite 
word U{x) is given by 

t/(x)n=2:(T"(x)), VnEN 

where I{y) = i if y G X^. 

In order to define a transformation such that each orbit is dense in [0, 1[, 
we add the following property: an interval exchange transformation is called 
regular if for all = ai < 02 < • • • < Ofc+i = 1 of the intervals Xi = [oj, Cj+i [ 
with i E {1, • • • , k}, we have 

T'^ai = aj, i and j G {2, 3, • • • , fc}, n € Z 

implies n = 0,i = j. 

M. Keane shows the following result ||lq|: 



Theorem 1 An interval exchange transformation T is regular if and only 
if for all point x in [0, 1[ the orbit Un£z{T^x} of x by T is dense in [0, 1[. 



A necessary condition to have a regular interval exchange transforma- 
tion is to take an irreducible permutation. A permutation is called irre- 
ducible if no subset of {1,2,- ■ ■ ,k} is invariant by the permutation, i.e., 
a{{l,2, ■■■,£}) ^{l,2,---,e} for alH< fc. 

Now we can state the main theorem of this section: 

Theorem 2 Let a regular interval exchange transformation T with k inter- 
vals. The infinite word U{x) associated with T has property Rk- 

In order to show this theorem, we first present a method to construct all 
the words of length n factors of the infinite word U{x) and secondly a way 
to construct return words by using self-induction. 

5.1 Factors 

The construction of all the factors is not so far from the one used for rotations 
on the unit circle. The main tool is to consider the negative orbit of all the 
endpoints = ai < 02 < • • • < a/c+i = 1 of the intervals Xi = [oj, ai+i[ with 
i € {1, • • • , k}. The set of endpoints {aj| \ < i < k + 1} \s called X^^'. 

A word w of length n is a factor of the infinite word U{x) if and only if 
there exists an interval ly^ C [0, 1[ and a point y in /„, such that the word w 
is the following concatenation of letters: 

l{x)l{T{x)) ■ ■ ■1{T'^-^) = w. 

Using this fact, the number of factors of length one is exactly the number 
of intervals /i = Xi , I2 = X2 ,■■■ ,Ik = Xf^ associated with the letters of the 
alphabet {1, 2, • • • , k}. 



Proposition 1 Let a regular interval exchange transformation T with k 
intervals. The infinite word U{x) associated with T has complexity p{n) = 
n(A;-l) + l,Vn G N. 

Proof The factors w = W1W2 of length 2 are given by the set {2{x)Z{T{x)),x G 
[0, ![} or equivalently by the intervals /«,^W2 = ^wi H T^^L^^. 

As the transformation T is a piecewise affine transformation, it is suf- 
ficient to find all the endpoints of the intervals /«, with \w\ = 2. That is 
the positions of the endpoints of all intervals with form /^^ nT~^/^2- These 
endpoints are given by the ordered set 

x(2) = {0 = 61 < ^2 < • • • < fc2fc = 1} = a:(^) U T-'^x^'^l 



Remark that the number of points in the union is 2k because the points 
and 1 are elements of the intersection of the two sets X^^' and T^^X^^' . 

The next step is to prove that each J^, is connected. By definition, all 
the intervals associated with words of length 1 are connected. The interval 
Imim.2 = ^mi H T^^ 1^2 is Connected because it is the intersection of two 
intervals in [0, 1[. 

In other words, for each word w with \w\ = 2 factor of the infinite word 
U{x), there exists an interval Iw = [bi, ^^+i[ where bi and 6^+1 are two con- 
secutive points in the ordered set X^"^' . 



By induction, let w = W1W2 ■ ■ ■ Wn a 


factor of 1( 


a point X such that 




w = I{x)X{T{x)) ■ 


..j(r"-i( 


or equivalently 




J-'Wl'W2---Wn ^ ' li = 0-- 


rp — i J 



») 



That is each endpoints of the interval associated with a word of length n is 
an element of the ordered set 

X(") = {0 = 61 < 62 < • • • < 6„(fc_i)+2 = 1} = X(1)UT-1X(1)U- • -UT-^+^X^i). 

Remark that the number of points in the union is n{k — 1) + 2 because at 
each step, we add k — 1 new points and for n = 1 we have k + 1 points in 
the partition. 

Furthermore, the intervals associated with factors of length n are con- 
nected. Indeed, the intervals are the intersection of connected intervals in 
[0,1[. 

In other words, for each word w with \w\ = n factor of the infinite 
word U{x), there exists an interval /^ = [be 6^+1 [ where bi and 6^+1 are two 
consecutive points in the ordered set X^"''. 

With this construction, we find n{k — 1) + 2 points for the endpoints 
in the partition X^"'' . We have n{k — 1) -|- 1 intervals in the partition and 
then the complexity function for a regular interval exchange in k intervals 
is equal to p(n, k) = n{k — 1) + 1, V(n, k) G N^. ■ 

Thus an infinite word associated with a regular interval exchange with 
two intervals is nothing but a Sturmian infinite word (i.e. an infinite word 
with complexity p{n,2) = n + 1 for all n). And an infinite word associated 
with a regular interval exchange with three intervals is an infinite word with 
complexity p{n, 3) = 2n -|- 1 for all n. 



5.2 self-induction 

Now, we focus on the construction of return words associated with a word 
w factor of the infinite word. 

Theorem 2 Let a regular interval exchange transformation T with k in- 
tervals. The infinite word U{x) associated with T has property R^. 

By the previous construction, we find a unique connected interval /^ 
associated with the word w. The main tool is to study the first return map 
in the adherence of the interval I^ . This method is used by Rauzy to give a 



generalized continued fraction algorithm [19, 



Proof Let r(y) = inft>o{T~*y E /^} be the negative first return time in 
the interval /^. Keane shows that the points {T~'^^y>y\y G I^,k{y) < oo} 
give a partition of the interval I^ in exactly k intervals /p^ , Ipj , • ' ' ; ^pk (see 
|18|). Indeed, the endpoints of these intervals are given by the first time that 
the negative orbit of the points in X^^> = {0 = ai < 02 < ■ ■ ■ < flfe+i = 1} 
falls in Iw 

There are k + 1 points in X^^' . This shows that for a general interval 
[q,/9], the number of induced points in ]a, /3[ is A; + 1 and that the number 
of induced intervals is k + 2. As the endpoints of /«, are both in the negative 
orbit of two different points in X^^' . More precisely, as the interval exchange 
transformation is regular, we have 

T^Oi = aj, i and j € {2, 3, • • • , k}, n e Z 

implies n = 0,i = j. Thus 

a = T^^oao,p = T^^^+iak+i. 

That is the number of induced points in the interior of I^ is k — 1. 
Thus the number of induced intervals is k. Such intervals with endpoints on 
the negative orbit of ao and a/c+i are called acceptable intervals by Rauzy 
(see |p!9| ). Indeed, for interval exchange transformation with k intervals, 
the induced transformation on an acceptable intervals is also an interval 
exchange transformation with k intervals. 

By construction, for each pi there exists y a Iw and t such that the word 
Pi = l{y)l{T{y)) ■ --liT^y)) ■ ■ ■l{T^+\'^\-'^{y)). Furthermore, the prefix of 
length I It; I of pi is exactly w (indeed y £ Iw) and the suffix of length \w\ 
of Pi is exactly w (indeed t is the positive first return time of y in the 
interval I^ defined by r^(y) = inft>o{T*y G Iw})- In other words, the 



number of return words over w is exactly the number of induced intervals. 
By construction /^ is an acceptable interval. Thus the number of return 
words over w is exactly k. That is, the infinite word associated with regular 
interval exchange transformation on k intervals has property i?^.. ■ 

Corollary 1 The length of the i-th return word ofw associated with the i-th 
induce interval pi is {wl + ki + k^ where ki is the given by the smallest fcj > 
such that T~^^(.i G lyj, where ki is the given by the smallest A;^ > such that 
T^i£i G /„, and where £i is the left endpoint of pi. 

Proof By construction, the left endpoint of pi is a. Each orbit of the 
points in pi according to the induce transformation is a translation of the 
orbit of Q. Then the length of the return word associated with pi is |w| + A;^ 
where k'l is the smallest A;^ > such that T'^ia G I^,. In other words, the 
length of the return word associated with the interval pi is exactly the sum 
of the positive first return time k'l of the point a in /^ and the length of w. 
By construction, each interval pi for i = 1, • • • /c has T~^^i-i for left end- 
point. Furthermore, each orbit of the points in pi according to the induce 
transformation is a translation of the orbit of T~^£i. Then the length of 
the return word associated with pi is \w\ + ki + k'^ where ki is the smallest 
ki > such that T~^^(!,i G Iw and where /c^ is the smallest k[ > Q such that 
T ili G lyj. In other words, the length of the return word associated with 
the interval pi is exactly given by the sum of the first time that T~^*£i is 
mapped on the point £i {£i is element of the partition X^^') and the positive 
first return time k[ of the point £i in /^ and the length of w. ■ 

6 Return words for codings of rotation 

As the codings of rotation are a special case of interval exchange transfor- 
mations, we have the following corollary: 

Corollary 2 Let T be a coding of rotations on k intervals with rationaly 
independent lengthes and by a rotation of length a equal to the length of one 
of the intervals. The infinite word U{x) associated with T has property Rk- 

Proof As the rotation is defined on the circle, we can translate all the 
intervals in order to have the first interval with length equal to a. The cod- 
ing of rotation of angle a on k intervals with the first interval of length 
a is equivalent to an interval exchange transformation with either the per- 
mutation (23 • • • /cl) if a > or the permutation (A;123 ■ ■ ■ k — 1) a < 0. 

10 



As the lengthes of the intervals are rationaly independent in the coding 
of rotation then the associated interval exchange transformation is regular. 
Consequently, the infinite word U{x) associated with T has property Rk- ■ 
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